Disentangled Variational Autoencoder for Emotion Recognition in Conversations

نویسندگان

چکیده

In Emotion Recognition in Conversations (ERC), the emotions of target utterances are closely dependent on their context. Therefore, existing works train model to generate response utterance, which aims recognise leveraging contextual information. However, adjacent generation ignores long-range dependencies and provides limited affective information many cases. addition, most ERC models learn a unified distributed representation for each lacks interpretability robustness. To address these issues, we propose VAD-disentangled Variational AutoEncoder (VAD-VAE), first introduces utterance reconstruction task based Autoencoder, then disentangles three affect representations Valence-Arousal-Dominance (VAD) from latent space. We also enhance disentangled by introducing VAD supervision signals sentiment lexicon minimising mutual between distributions. Experiments show that VAD-VAE outperforms state-of-the-art two datasets. Further analysis proves effectiveness proposed module quality representations. The code is available at https://github.com/SteveKGYang/VAD-VAE.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multi-Level Variational Autoencoder: Learning Disentangled Representations from Grouped Observations

We would like to learn a representation of the data that reflects the semantics behind a specific grouping of the data, where within a group the samples share a common factor of variation. For example, consider a set of face images grouped by identity. We wish to anchor the semantics of the grouping into a disentangled representation that we can exploit. However, existing deep probabilistic mod...

متن کامل

Using denoising autoencoder for emotion recognition

In this paper, we propose to use the denoising autoencoder to generate robust feature representations for emotion recognition. In our method, the input of the denoising autoencoder is the normalized static feature set (state-of-the-art features for emotion recognition). This input is mapped to two hidden representations: one is to capture the neutral information from the input, and the other on...

متن کامل

Tied Variational Autoencoder Backends for i-Vector Speaker Recognition

Probabilistic linear discriminant analysis (PLDA) is the de facto standard for backends in i-vector speaker recognition. If we try to extend the PLDA paradigm using non-linear models, e.g., deep neural networks, the posterior distributions of the latent variables and the marginal likelihood become intractable. In this paper, we propose to approach this problem using stochastic gradient variatio...

متن کامل

Variational Lossy Autoencoder

Representation learning seeks to expose certain aspects of observed data in a learned representation that’s amenable to downstream tasks like classification. For instance, a good representation for 2D images might be one that describes only global structure and discards information about detailed texture. In this paper, we present a simple but principled method to learn such global representati...

متن کامل

Quantum Variational Autoencoder

Variational autoencoders (VAEs) are powerful generative models with the salient ability to perform inference. Here, we introduce a quantum variational autoencoder (QVAE): a VAE whose latent generative process is implemented as a quantum Boltzmann machine (QBM). We show that our model can be trained end-to-end by maximizing a well-defined loss-function: a “quantum” lowerbound to a variational ap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Affective Computing

سال: 2023

ISSN: ['1949-3045', '2371-9850']

DOI: https://doi.org/10.1109/taffc.2023.3280038